(N 






The Annals of Probability 

2012, Vol. 40, No. 2, 743-764 

DOI: 10.1214/10-AOP631 

@ Institute of Mathematical Statistics, 2012 

SUBLINEARITY OF THE TRAVEL-TIME VARIANCE 
FOR DEPENDENT FIRST-PASSAGE PERCOLATION 



O ■ By Jacob van den Berg and Demeter Kiss-*^ 

^ . CWI and VU University Amsterdam, and CWI 

.^^ ' Let E be the set of edges of the d-dimensional cubic lattice Z , with 

d > 2, and let i(e),e € E, be nonnegative values. The passage time from 

^^ ■ a vertex « to a vertex w is defined as inf^ : ^-^w X^cgtt *(^)' where the infi- 

mum is over all paths tt from v to w, and the sum is over all edges e of tt. 
Benjamini, Kalai and Schramm [2] proved that if the t(e)'s are i.i.d. 

fs^ ' two-valued positive random variables, the variance of the passage time 

^^ ' from the vertex to a vertex v is sublinear in the distance from to u. 

This result was extended to a large class of independent, continuously 

,J^ ■ distributed i- variables by Benaim and Rossignol [1]. 

j^ , We extend the result by Benjamini, Kalai and Schramm in a very 

different direction, namely to a large class of models where the i(e)'s are 
dependent. This class includes, among other interesting cases, a model 
studied by Higuchi and Zhang [9], where the passage time corresponds 

^vj ■ with the minimal number of sign changes in a subcritical "Ising land- 

^ ' scape.' 

o^: 

QQ . 1. Introduction and statement of results. Consider, for d>2, the d-di- 

(^ I mensional lattice Z,'^. Let E denote the set of edges of the lattice, and let t(e), 

r^ ■ e G E, be nonnegative real values. A path from a vertex u to a vertex w is 

^^ I an alternating sequence of vertices and edges 

o 



Vo = V,ei,Vi,e2,...,Vn-l,en,Vn = W, 

where each Cj is an edge between the vertices fi_i and Vi, 1 < i <n. To 

indicate that e is an edge of a path vr, we often write, with some abuse of 
^ I notation, e G vr. 

5t ■ If V = {vi,...,Vd) is vertex, we use the notation \v\ for X^j^iI^jI- The 

(graph) distance d{v,w) between vertices v and w is defined as |f — -u;!. The 

vertex (0, . . . , 0) will be denoted by 0. 
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2 J. VAN DEN BERG AND D. KISS 

The passage time of a path vr is defined as 

(1) r(vr)=J]t(e). 

egTT 

The passage time (or travel time) T{v,'w) from a vertex ti to a vertex w 
is defined as 

T{v,w) = inf T(7r), 

where the infimum is over all paths vr from v io w. 

Analogous to the above described bond version, there is a natural site 
version of these notions: in the site version, the t-variables are assigned to 
the vertices instead of the edges. In the definition of T(7r), the right-hand 
side in (1) is then replaced by its analog where the sum is over all vertices 
of vr. There seems to be no "fundamental" difference between the bond and 
the site version. 

An important subject of study in first-passage percolation is the asymp- 
totic behavior of T(0,u) and it fluctuations, when l^il is large and the t(e)'s 
are random variables. It is believed that, for a large class of distributions 
of the t(e)'s, the variance of r(0,f) is of order |up^. However, this has 
only been proved for a special case in a modified (oriented) version of the 
model [11]. Apart from this, the best upper bounds obtained for the variance 
before 2003 were linear in \v\ [13]. See Section 1 of [2] for more background 
and references. 

Benjamini, Kalai and Schramm [2] showed that if the t(e)'s are i.i.d. ran- 
dom variables taking values a and h, b>a>0, then the variance of T{0,v) 
is sublinear in the distance from to w. More precisely, they showed the 
following theorem. 

Theorem 1.1 (Benjamini, Kalai and Schramm [2]). Let b>a>0. If the 
(t(e),e € E) are i.i.d. random variables taking values in {a,b}, then there is 
a constant C > such that, for all v with \v\ > 2, 

(2) YaT{T{0,v))<C 



log|f I 



Benaim and Rossignol [1] extended this result to a large class of i.i.d. 
t-variables with a continuous distribution, and also proved concentration 
results. See also [5]. 

We give a generalization of Theorem 1.1 in a very different direction, 
namely to a large class of dependent t-variables. The description of this 
class, and the statement of our general results are given in Section 1.4. 

Using our general results, we show in particular that (2) holds for the 
{a, 6}-valued Ising model with < a < 6 and inverse temperature fi < (3c- By 
{a, 6}-valued Ising model, we mean the model that is simply obtained from 
the ordinary, { — 1, +l}-valued, Ising model by replacing —1 by a and +1 
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by b. The precise definition of the Ising model and the statement of this 
result is given in Section 1.1. 

We also study, as a particular case of our general results, a different Ising- 
like first-passage percolation model: consider an "ordinary" Ising model 
(with signs —1 and +1), with parameters (3 < f3c and with external field h 
satisfying certain conditions. Now define the passage time T{v, w) between 
two vertices v and w as the minimum number of sign changes needed to 
travel from v to w. Higuchi and Zhang [9] proved, for d = 2, a concentra- 
tion result for this model. This concentration result implies an upper bound 
for the variance that is (a "logarithmic-like" factor) larger than linear. We 
show from our general framework that the sublinear bound (2) holds (see 
Theorem 1.5). 

The last special case we mention explicitly is that where the collection of 
t-variables is a finite- valued Markov random field which satisfies a high-noise 
condition studied by Haggstrom and Steif (see [7] ) . Again it follows from our 
general results that the sublinear bound (2) holds (see Theorem 1.4). 

The general organization of the paper is as follows: in the next three 
subsections, we give precise definitions and statements concerning the special 
results mentioned above. Then, in Section 1.4, we state our main, more 
general results, Theorems 1.6 and 1.7. 

In Section 2, we prove the special cases (Theorems 1.2, 1.4 and 1.5) from 
Theorems 1.6 and 1.7. 

In Section 3, we present the main ingredients for the proofs of our general 
results: an inequality by Talagrand (and its extension to multiple-valued 
random variables), a very general "randomization tool" of Benjamini, Kalai 
and Schramm, and a result on greedy lattice animals by Martin [15]. 

In Section 4, we first give a very brief informal sketch of the proof of 
Theorem 1.6 (pointing out the extra problems that arise, compared with 
the i.i.d. case in [2]), followed by a formal, detailed proof. 

The proof of Theorem 1.7 is very similar to that of Theorem 1.6. This is 
explained in Section 5. 

1.1. The case where the t-variables have an {a,b}-valued Ising distribu- 
tion. Recall that the Ising model (with inverse temperature /3 and external 
field h) on a countably infinite, locally finite graph G is defined as follows. 
First some notation: we write v ^ w to indicate that two vertices v and w 
share an edge. For each vertex v of G, the set of vertices {v.w^ v} is de- 
noted by dv. The spin value (+1 or —1) at a vertex v is denoted by a^. 
Now define, for each vertex v and each a € {— 1,+1} , the distribution 
Qv='lW,on{-l,+l}: 

exp(/3(/i-hX;»^^,"«.)) 



??(+!) 



exp(/3(/i + T,wr^v ««')) + exp(-^(/i + E«,--„ "«;)) ' 



(3) 
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exp(/3(/i + X;^^^ aw)) + exp(-/3(/i + Yl^^v "'^)) ' 
Let V denote the set of vertices of G. An Ising distribution on G (with 
parameters /3 and h) is a probability distribution jjjs^h on {—1,+!} which 
satisfies, for each vertex v and each t] G {— 1,+1}, 

(4) ^l|3^h{<^v=r]\a^,,w^v) = q^Bv(^^^^ /i/3,h-a.s. 

In this (usual) setup, the spin values are assigned to the vertices. One 
can define an Ising model with spins assigned to the edges, by replacing G 
by its cover graph (i.e., the graph whose vertices correspond with the edges 
of G, and where two vertices share an edge if the edges of G to which these 
vertices correspond, have a common endpoint). 

In the case where G is the d-dimensional cubic lattice Z*^, with d > 2, it is 
well known that there is a critical value /3c G (0, (X)) such that the following 
holds: if /3 < /3c, there is a unique distribution satisfying (4). If /3 > /3c and 
h = 0, there is more than one distribution satisfying (4). A similar result 
(but with a different value of /3c) holds for the edge version of the model. 

Let 6 > a > 0. An {a,b}-valued Ising model is obtained from the usual 
Ising model by reading a for —1 and b for +1. More precisely, if (cr^,f € y) 
has an Ising distribution and, for each v G V, t{v) is defined to be a if 
a^ = —1 and b if ay = +1, then we say that {t{v),v E V) are {o, 6}-valued 
Ising variables. A similar definition holds for the situation where the spins 
are assigned to the edges. 

A special case of our main result is the following extension of Theorem 1.1 
to the Ising model. 

Theorem 1.2. Let b > a> and d>2. If {t{v),v G Z"*) are {a,b}- 
valued Ising variables with inverse temperature /3 < /3c and external field h, 
then there is a constant C > such that for all v with 1^1 > 2, 

(5) Var(r(0,u))<C 



log|u| 

The analog of this result holds for the case where the values a, b are assigned 
to the edges. 

1.2. Markov random fields with high-noise condition. Let ((T^,f € Z'^), be 
a translation invariant Markov random field taking values in W'^ where W 
is a finite set. Let w G Z"^. For each w gW define (see [7]) 

'jy, = min P((T„ = w\aQy = Vj). 



Further, define 



/ ^ Iw 
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Note that the definition of 7^^ and 7 does not depend on the choice of v. 
Haggstrom and Steif [7] studied the existence of finitary codings (and exacts 
simulations) of Markov random fields under the following high-noise (HN) 
condition (see also [6] and [21]). 

Definition 1.3 (HN condition). A translation invariant Markov ran- 
dom field on Z satisfies the HN condition, if 

2d-l 



7> 



2d 



We will show that the following theorem is a consequence of our main 
result. 

Theorem 1.4. Let d>2 and let {(Jv,v € Z ) be a translation invariant 
Markov random field taking finitely many, strictly positive values. If this 
Markov random, field satisfies the HN condition, then, for the first-passage 
percolation model with t{v) =ay,v S Z*^, there is a constant C > such that 
for all V with \v\ > 2, 

(6) \ar(T(0,v))<C-^. 

log|u| 

The analog of this result holds for the edge version of the model. 

Remark. The HN condition for the edge version is a natural modifica- 
tion of that in Definition 1.3. For instance, the 2d in the numerator and the 
denominator of the right-hand side of the inequality in Definition 1.3 is the 
number of nearest-neighbor vertices of a given vertex, and will be replaced 
by 4(i — 2 (which is the number of edges sharing an endpoint with a given 
edge) . 

1.3. The minimal number of sign changes in an Ising pattern. In Sec- 
tion 1.1, the collection of random variables {t{v),v € Z*^) itself had an Ising 
distribution (with —1 and +1 translated to a, resp., b). A quite different 
first-passage percolation process related to the Ising model is the one, stud- 
ied by Higuchi and Zhang [9] , where one counts the minimal number of sign 
changes from a vertex u to a vertex w in an Ising configuration. 

For /? < /3c, let 0(/3, h) denote the probability that belongs to an infinite 
+ cluster, and let 

h,{f3) = snp{h:e{/3,h) = 0}. 

For d = 2, it was proved in [8] that hc{l3) > 0. 

Using our general results, we will prove (in Section 2) the following ex- 
tension of Theorem 1.1. 
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Theorem 1.5. Let the collection of random variables [cr^jV € Z^) have an 
Ising distribution with parameters f3 < /3c o-nd external field h, with \h\ < he. 
Define, for each edge e = (^1,^2), 

1, if (^vi ^crv2, 



^^""^-{O, if a,. 

For the first-passage percolation model with these t-values, there is a C > 
such that for all v with \v\ > 2, 

(7) Var(r(0,?;))<C 



log|f I 



Remark. Higuchi and Zhang [9] give a concentration result for this 
model (see Theorem 2 in [9]). Their method is very different from ours. [It is 
interesting to note that the paragraph below (1.11) in their paper suggests 
that Talagrand-like inequalities are not applicable to the Ising model.] The 
upper bound for the variance of T{0,v) which follows from their concen- 
tration result is (a "logarithmic-like" factor) larger than linear. For earlier 
results on this and related models, see the Introduction in [9]. 

1.4. Statement of the main results. Our main results, Theorems 1.6 
and 1.7, involve t-variables that can be represented by (or "encoded" in 
terms of) i.i.d. finite-valued random variables in a suitable way, satisfying 
certain conditions. These conditions are of the same flavor as (but somewhat 
different from) those in Section 2 in [20]. 

We first need some notation and terminology. Let 5 be a finite set, and / 
a countably infinite set. Let W^ be a finite subset of I. li x £ S^ , we write xw 
to denote the tuple {xi,i £ W). If /i : S*^ — >■ M is a function, and y € S^ , we 
say that y determines the value of h if h{x) = h{x') for all x, x' satisfying 

xw = a^W = y- 

Let Xj, i G /, be i.i.d. S'-valued random variables. We say that the random 
variables t{v),v G Z'^, are represented by the collection iXi,i G /), if, for each 
V G Z'^, t{v) is a function of {Xi,i G /). The formulation of our main theorems 
involve certain conditions on such a representation: 

• Condition (i) : There exist cq > and eo > such that for each u G Z there 
is a sequence ii (v) , Z2 (v) , • • • of elements of I, such that for all k = l,2,. . . , 

(8) P((Xi^(„), . . . ,Xi^(t,)) does not determine t{v)) < -p^ip^- 

• Condition (ii): 

3q >0 Vw,wGZ'^ V/c <a\v - w\ 
(9) 

{ii{v), . . .,ik{v)} n {ti{w), . . .,ikiw)} = 0. 

• Condition (iii) : The distribution of the family of random variables {t{v) , v G 
Z*^) is translation-invariant. 
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We say that the family of random variables {t{v),v G Z ) has a repre- 
sentation satisfying conditions (i)-(iii), if there are S, I and i.i.d. 5- valued 
random variables Xi,i ^ I as above, such that the t-variables are functions 
of the X-variables satisfying conditions (i)-(iii) above. 

Analogs of these definitions for t- variables indexed by the edges of Z can 
be given in a straightforward way. 

Now we are ready to state our main theorem. 

Theorem 1.6. Let b> a>0, and let, with d>2, {t{v),v e Z'^) be a fam- 
ily of random variables that take values in the interval [a, b] and have a rep- 
resentation satisfying conditions (i)-(iii) above. Then there is a C > 0, such 
that for all f € Z with \v\ >2, 

(10) Var(T(0,7;))<-^. 

log|u| 

The analog for the bond version of this result also holds. 

If the i-variables can take values equal or arbitrarily close to 0, we need 
a stronger version of condition (i) and extra condition (iv) (see below). 

By an optimal path from v to w, we mean a path vr from v to w such that 
T{tt) < T{tt') for all paths vr' from v to w. 

• Condition (i'): There exist cq > 0, eo > and ei > 0, such that for each 
V ^TJ^ there is a sequence i\{v)^i2(v)^ ... of elements of /, such that for 
all A; = 1,2,..., 

(11) P((Xjj(^), . . . ,Xj^(-^,')) does not determine t{v)) < coexp(— sq^^^)- 

• Condition (iv): There exist ci,C2,C3 > such that for all vertices v^w the 
probability that there is no optimal path vr from vtow with |vr| <c\\v — w\ 
is at most C2exp(— csj-y — w|). 

Theorem 1.7. Let 6 > 0, and let, with d>2, {t{v),v G Z'^) be a col- 
lection of random variables taking values in the interval [0,6], and having 
a representation satisfying conditions (i'), (ii), (iii) and (iv) above. Then 
there is a C >0, such that for all v £l, with \v\ > 2, 

(12) Var(T(0,f)<-^. 

logpl 

The analog of this result for the bond version of the model also holds. 

Remarks. 

(a) Note that condition (iii) is in terms of the t-variables only: we do not 
assume that the index set I has a "geometric" structure and that the t- 
variables are "computed" from the X-variables in a "translation-invariant" 
way with respect to that structure (and the structure of Z'^). 
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(b) The goal of our paper is to show that the main result in [2], although 
its proof heavily uses inequalities concerning independent random variables, 
can be extended to an interesting class of dependent first-passage percolation 
models. In the setup of the above conditions (i), (ii), (iii), (i') and (iv), we 
have aimed to obtain fairly general Theorems 1.6 and 1.7, without becoming 
too general (which would give rise to so many extra technicalities that the 
main line of argument would be obscured). For instance, from the proofs 
it will be clear that there is a kind of "trade-off" between conditions (i) 
and (ii) : one may simultaneously strengthen the first and weaken the second 
condition. 

Also, if the bound in condition (i') is replaced by a polynomial bound with 
sufficiently high degree. Theorem 1.7 would still hold (but more explanation 
would be needed in Section 5). Since the main motivation for adding this 
theorem to Theorem 1.6 is to handle the interesting Ising sign-change model 
studied by Higuchi and Zhang [for which we know that condition (i') holds] 
we have not replaced condition (i') by a weaker condition. 

2. Proofs of Theorems 1.2, 1.4 and 1.5 from Theorems 1.6 and 1.7. 

2.1. Proof of Theorem 1.2. In [20], the notion "nice finitary representa- 
tion" has been introduced in the context of two-dimensional random fields. 
See conditions (i)~(iv) in Section 2 of that paper. In Section 2 (see in par- 
ticular Theorem 2.3 in that paper), it is shown that the Ising model with 
/? < /3c has such a representation. (See also [21].) The key ideas and ingre- 
dients are exact simulation by coupling from the past (see [17] and [21]), 
and a well-known result by Martinelli and Olivieri [16] that under a natural 
dynamics (single-site updates; Gibbs sampler) the system has exponential 
convergence to the Ising distribution. The random variables used to execute 
these updates are taken as the X-variables in the definition of a representa- 
tion. 

Condition (ii) in [20] is somewhat weaker than our current condition (i). 
However, as shown in [20] (see the arguments between Theorems 2.3 and 2.4 
in [20]), the above mentioned exponential convergence shows that the Ising 
model satisfies an even stronger bound, namely condition (i') in our paper. 

Condition (iii) in [20] corresponds with our condition (ii), and condi- 
tion (iv) in [20] is stronger than our condition (iii). 

In [20], only the two-dimensional case is treated (because the applications 
are to percolation models where typical two-dimensional methods are used) 
but its arguments concerning "nice finitary representations" for the Ising 
model extend immediately to higher dimensions. 

From the above considerations, it follows that the Ising models in the 
statement of our Theorem 1.2 indeed have a representation satisfying our 
conditions (i)-(iii). Application of Theorem 1.6 now gives Theorem 1.2. 
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2.2. Proof of Theorem 1.4- The argument is very similar to that in the 
proof of Theorem 1.2. Therefore, we only mention the points that need extra 
attention. 

As in the proof of Theorem 1.2, the role of the X-variables in Section 1.4 
is played by the i.i.d. random variables driving a single-site update scheme 
(Gibbs sampler). In Theorem 1.2, a form of exponential convergence for the 
Gibbs sampler was used. This exponential convergence came from a result 
in [16]. In the current situation, the exponential convergence is, as shown in 
Proposition 2.1 in [7], a consequence of the HN condition. This exponential 
convergence implies (again, as in the case of Theorem 1.2) condition (i) [and, 
in fact, the stronger condition (i')] in Section 1.4. Condition (iii) is obvious, 
and condition (ii) follows easily (as in the proof of Theorem 1.2) from the 
general setup of the Gibbs sampler. So, again, we now apply Theorem 1.6 
to obtain Theorem 1.4. 

2.3. Proof of Theorem 1.5. Since (3 < /3c, the collection {ay,v S Z^), has 
(as pointed out in the proof of Theorem 1.2) a representation satisfying 
conditions (i), (ii) and (iii). In fact, as noted in the proof of Theorem 1.2, it 
even satisfies the stronger form (i') of condition (i). Since t(e) is a function 
of the cj-values of the two endpoints of e, it follows immediately that the 
collection (t(e),e G E) (where E denotes the set of edges of the lattice Z^) 
satisfies the (bond analog of) the conditions (i'), (ii) and (iii). The fact 
that (iv) is satisfied follows immediately from Lemma 6 [and (1.9)] in [9]. 
Theorem 1.5 now follows from (the bond version of) Theorem 1.7. 

3. Ingredients for the proof of Theorem 1.6. 

3.1. An inequality by Talagrand. Let S be a finite set and n a positive 
integer. Assign probabilities ps, s £ S, to the elements of S. Let n be the 
corresponding product measure on i} := S"'. 

Let / be a function on 0, and let ||/||i and ||/||2 denote the Li-norm and 
L2-norm of / w.r.t. the measure fi: 

■.= Y,f,{x)\f{x)\; 
xen 



/x(x)|/(x)|2 



zesi 



The notation /j is used for the conditional expectation of / given all 
coordinates except the ith. More precisely, for x = {xi, . . . , x„) € S^ we define 

fi{x) ■.= '^Psf{xi,...,Xi-i,S,Xi+i,...,Xn). 
sGS 
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Further, we define the function Aj/ on ft by 

(13) {Aif){x) = f{x)-fiix), xen. 

Notation AL Remark. Often we work with the alternative, equivalent, 
description that we have n independent random variables, say Zi,...,Z„, 
with P{Zi = s) = ps, s ^ S,l <i <n. To emphasize the identity of the ran- 
dom variables involved, we then often use the notation A^. instead of Aj. 

A key ingredient in [2] and in our paper is the following inequality for 
the case l^l = 2 by Talagrand, a far-reaching extension of an inequality by 
Kahn, Kalai and Linial [12]. 

Theorem 3.1 (Talagrand [19], Theorem 1.5). There is a constant K >0 
such that for each n and each function f on {0, 1}", 



(14) Var(/)<A'log(-^)|: 



1A./||2 



2 



p(l-p)y^log(e||A,/||2/||A,/||i)' 

where (in the notation in the beginning of this section) p = pi = l — po, o-nd 
where Var(/) denotes the variance of f w.r.t. the measure fi. 

In the literature, (partial) extensions of this inequality and inequalities 
of related flavor, to the case |5| > 2 have been given; see, for example, [18] 
and [1]. The following theorem (see [14]) states the most "literal" extension 
of Theorem 3.1 to the case \S\ > 2. (In [14], an extended version of Beckner's 
inequality, a key ingredient in the proof of Theorem 3.1, is used, and the 
proof of Talagrand is followed, with appropriate adaptations, to obtain the 
extension of Theorem 3.1.) To make comparison of our line of arguments with 
that in [2] as clear as possible, it is this extension we will use. (Moreover, 
if instead of Theorem 3.2 we would use the modified Poincare inequalities 
in [1], this would not simplify our proof of Theorem 1.6.) 

Theorem 3.2 ([14], Theorem 1.3). There is a constant K > such that 
for each finite set S, each n G N and each function f on S" the following 
holds: 



(15) Var(/) < Klog(—^ ) V 



1 A ^ IIA./II 



minseSPsJ ^log(e||Ai/||2/||Ai/||i)' 

3.2. Greedy lattice animals. The subject of this subsection played no role 
in the treatment of the first-passage percolation model with independent t- 
variables in [2] , but turns out to be important in our treatment of dependent 
t- variables. 
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Consider, for d>2, the d-dimensional cubic lattice. A lattice animal (ab- 
breviated as l.a.) is a finite connected subset of Z,'^ containing the origin. 
Let X^, u S Z*^, be i.i.d. nonnegative random variables with common distri- 
bution F. Define 

N{n) := max > Xy, 

t-.C l.a. with \C\=n^'^ 

where the maximum is over all lattice animals of size n. 

The subject was introduced by Cox et al. [3]. The asymptotic behavior, 
as n — >■ oo of N{n) has been studied in that and several other papers (see, 
e.g., [4] and [10]). For our purpose, the following result by Martin [15] is 
very suitable. 

Theorem 3.3 (Martin [15], Theorem 2.3). There is a constant C such 
that for all n and for all F that satisfy 



(\-F(x)fl^dx 



< oo. 



'0 

(16) ^(^) - ^ r^^ ~ ^(^))'^''^^- 

Martin [15] says considerably more than this, but the above is sufficient 
for our purpose. 

3.3. A randomization tool. As in [2] we need, for technical reasons, a cer- 
tain "averaging" argument: extra randomness is added to the system to 
make it more tractable. To handle this extra randomness appropriately, the 
following lemma from [2] is used. 

Lemma 3.4 (Benjamini, Kalai and Schramm [2], Lemma 3). There is 
a constant c > such that for every tti G N there is a function 

<7 = 9„^:{0,ir'^{0,l,...,m}, 

which satisfies properties (i) and (ii) below: 

(i) For all i = l,. . . ,m'^ and all x G {0, 1}™ , 

(17) \9m{x^'^)-9mix)\<l, 

where x'*^ denotes the element of {0, 1}*" that differs from x only in the ith 
coordinate. 
(ii) 

(18) maxP(g(y) = fc) <c/m, 

h 

where y is a random variable uniformly distributed on {0, 1}™ . 
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4. Proof of Theorem 1.6. To keep our formulas compact, we will use 
constants Ci, C2, ■ ■ ■ ■ The precise values of these constants do not matter for 
our purposes. Some of them depend on a, b, the dimension d, the distribution 
of the X-variables (in terms of which the t-variables are represented), or 
the constants in the conditions (i), (i'), (ii), (iii) and (iv) in Section 1.4. 
However, they do not (and obviously should not) depend on the choice of v 
in the statement of the theorem. 

4.1. Informal sketch. The detailed proof is given in the next subsection. 
Now we first give a very brief and rough summary of the proof of the main 
result in [2] (listed as Theorem 1.1 in our paper), and then informally (and 
again briefly) indicate the extra problems that arise in our situation where 
the t-variables are dependent. 

Let 7 be the path from to t" for which the sum of the i-variables is 
minimal. (If more than one such path exists, choose one of these by a deter- 
ministic procedure.) Since the value of each i-variable is at least a > and 
at most b, it is clear that the number of edges of 7 is at most a constant c 
times \v\. 

In [2] the t-variables are independent, and Talagrand's inequality (The- 
orem 3.1) is applied with / = T{0,v) and with each i denoting an edge e. 
From the definitions, it is clear that Aj/ is roughly the change of T{0,v) 
caused by changing t{e). Moreover, a change of t{e) can only cause a change 
of T{0,v) if, before or after the change, e is on the above mentioned path 7. 
So, ignoring the denominator in Talagrand's inequality, one gets the (linear) 
bound 



(19) Var(r(0,w))<CiE 



E(^-«)' 



£67 



<c{b-aY\v 



It turns out that, by introducing additional randomness in an appropriate 
way, without changing the variance (see Lemma 3.4), the ||Aj/||2/||Aj/||i 
in the denominator in the right-hand side of Talgrand's inequality becomes 
(uniformly in i) larger than \v\^ for some /3 > 0, thus giving the log|i;| (and 
hence, the sub linearity) in Theorem 1.1. 

In our situation, the underlying independent random variables are the Xi, 
i ^ I (by which the dependent t- variables are represented). Application of 
Talagrand-type inequalities to these variables has the complication that 
changing one X- variable changes a (random) set of possibly many t-variables. 
Taking the square of the effect complicates this further. Nevertheless, it 
turns out that by suitable decompositions of the summations, and by block 
arguments (rescaling), one finally gets, instead of (19) a bound in terms of 
("rescaled") greedy lattice animals which, by the result of Martin in Sec- 
tion 3, is still linear in \v\. 
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To handle the denominator in the Talagr and- type inequahty, we use ad- 
ditional randomness, as in [2]. Again, the fact that changing an X- variable 
can have effect on many i- variables complicates the analysis, but this com- 
plication is easier to handle than that for the numerator mentioned above. 

4.2. Detailed proof. We give the proof for the site version of Theorem 1.6. 
The proof for the bond version is obtained from it by a straightforward, step- 
by-step translation. 

Notation AL Remark. The cardinality of a set V wih be indicated 
by \V\. 

We start by stating a simple but important observation (a version of 
which was also used in [2]). A finite path vr is called an optimal path, or 
a geodesic, if there is no path tt' / vr with the same starting and endpoint 
as vr, for which T{'k') < T[tt). 

Observation 4.1. Since the t-variables are bounded away from and oo, 
there is a constant C2 > such that for every positive integer n and every 
w GZ the following hold: 

(a) Each geodesic has at most C2n vertices in the box w + [—n,n]'^. 

(b) Each geodesic which starts at and ends at w has at most C2\w\ 
vertices. 

Let Xi,i (^ I, be the independent random variables in terms of which the 
variables {t{v),v € Z*^) are represented. So T{0,v) is a function of the X- 
variables. As we said in the informal sketch, we introduce extra randomness, 
in the same way as in [2]: fix m := [|up^J. Let {yl,i = 1, . . . ,m?,j = 1,. . . ,d) 
be a family of independent random variables, each taking value or 1 with 
probability 1/2. The family of y^'s is also taken independently of the X- 
variables. Define, for j = 1, . . . ,d, 

y^ = (2/1, ■•■, 2/^2)- 

Each y^ is uniformly distributed on {0, 1}"^ , and will play the role of the y 
in Lemma 3.4. We simply write Y for the collection {yl,i = 1, . . . ,m'^,j = 
1, . . . ,d) and X for the collection (Xj, i e /). 
Let 

(20) z(y) = (g(yi),...,g(/)) 

with g = gm as in Lemma 3.4. 

To shorten notation, we will write / for T{0, v) and / for the passage 
time between the vertices that are obtained from and f by a (random) 
shift over the vector z{Y): 

(21) f = T{z{Y),v + ziY)). 
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Note that / is completely determined by X, while / depends on X as 
well as Y . 

By translation invariance [see condition (iii)], for every w G Z'^, T(0,u) 
has the same distribution as T{w,v + w). Hence, by conditioning on Y and 
using that Y is independent of the t-variables, it follows that / has the same 
distribution as /. In particular, 

(22) Var(/)=Var(/). 

Theorem 3.2 gives (see the Remarks below) 



Var(/) < Cs Yl 

(23) 

+ C, 



\J^ 



2 



.=l,...,^^,=l,...,.l+l°g(||^^/||2/||^^/||l^ 

E^eI\\^xJg 



■l + min,e/log(||AxJ||2/||AxJ||i) 

Remarks. 

(a) At first sight, Theorem 3.2 is not applicable in the current situa- 
tion where we have two types of random variables: Xj's and y^'s. However, 
by a straightforward argument, "pairing" each variable yf, i = l,...,m^, 
j = 1,. . . ,d, with an independent "dummy" variable Xj (with the same dis- 
tribution as the "ordinary" X- variables), and each variable Xi, i (^ I, with 
an independent "dummy" variable yi (with the same distribution as the "or- 
dinary" y-variables) , it is easy to see that Theorem 3.2 is indeed applicable 
here. 

(b) Note that the statement of Theorem 3.2 is formulated for finite n. 
Combined with a standard limit argument, it gives (23). 

We will handle, in separate subsections, the first term of (23), the numer- 
ator of the second term, and the denominator of the second term. 

4.2.1. The first term in (23). By (20), (21) and (17) it follows that |A jf\ 
is at most a constant C4, so that we have the following lemma. 

Lemma 4.2. The first term in (23) is at most 
(24) <dC4,m^ = dC4\v\^''^. 

4.2.2. The denominator of the second term in (23). In this subsection 
we write, for notational convenience, Aj/ for Ax^/, where z € /. 

If w, w' € 1r we write '^w,w' for the path vr minimizing ^^g^ t{w). If there 
is more than one such path, we use a deterministic, translation-invariant way 
to select one. If w = and w' is our "fixed" f , we write simply 7 for ^qv 
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Recall that z = z{Y) is the random shift. We write 7(2;) for 'jz^v+z- 
Also recall the definitions and notation in Section 1.4. If it; G Z and 

j G /, we say that w needs j if j = ik{w) for some positive integer k, and 

^ii(u))) • • • ^-^ik-i{w) does not determine t{w). 

By a well-known second-moment argument we have, for each j € /, 

(25) mi> ' 



^ifh Jp(Ajf^O) 



Note that, given z{Y) and all Xi,i G /\ {j}, there is a, possibly nonunique, 
s = s{j,X,Y) € S such that / (now considered as a function of Xj only) 
takes its smallest value at Xj = s. Further note that if Ajf^O then, after 
replacing the value of Xj by s, we have Ajf<0. So we get 

F{Ajf< 0) > F{Ajf^O) minP(X,- = r), 

reS 

and hence 

P(A,-/<0) 



(26) P(A,//0)< 



min^gsP(Xj 



Moreover, it follows from the definitions that if Ajf < 0, there is a w 
on 7(2;) such that a certain change of Xj causes a change of t{w). By this 
and (26), we have 



F{Ajfy^O) < C5 V F{w G 'y{z),w needs j) 



(27) 



< C5 y^ min(P(w e 7(z)),P(u; needs j)). 



Ulg 



Recall the definition of ra in the paragraph following Observation 4. 1 . Let 
w gTL'^ and consider the box Bm{w) := w -|- [—m, m]'^. We have 

F{w £ -/{z)) =F{w - z e j). 

By the construction of z, and (17), w — z takes values in the above mentioned 
box Bmiw). Also by the construction of z, and (18), each vertex of the box 
has probability < C^jm^ to be equal to w — z. Moreover, by Observation 4.1 
at most C-jra vertices in the box are on 7. Hence, since 7 is independent of z, 
it follows (by conditioning on 7) that 



(28) F(w£-i[z))<C-jm—,<C^\v 



m 



-(d-l)/4 



Further, by condition (i), we have 
(29) F{w needs j) < ^° 



^^(^•)3d+eo 
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where r^iij) (which we call the rank of j) is the positive integer k for which 
ik{w) =j. 

By (27), (28) and (29), we have, for every K, 

(30) FiA,f^O)<cJ\v\~^'-'y^\{w-.rUj)<K}\+ f^ '^"^gi£ = ^^' ) 

\ k=K I 

Now, condition (ii) implies, for each j € / and each /c > 0, 

(31) |{u^:r^(j)<A:}|<Ciofc'^. 

Hence, the first term between the brackets in (30) is at most 

(32) Ciol^;!"^'^-^)/^/^'^. 

Further, using again (31) (and summation by parts) the sum over k in (30) 
is at most 

(33) ^nEp^T^^^i^^"''""- 

k=K 

Combining (30), (32) and (33), we get 

(34) P(Aj//0) <Ci3(|t;r('^-^)/^i^'^ + i^-2'^-^«). 

Now take for K the smallest positive integer satisfying K > \v\^ ''^ 
and insert this in (34). This gives 

(35) p(Aj//o) < Cl4b^^'^"^)/^ 

which together with (25) yields the following lemma. 

Lemma 4.3. There is a constant C15 > such that for all v ^TL'^ the 
denominator of the second term in (23) is larger than or equal to 

Ci^\og\v\. 

4.2.3. The numerator of the second term in (23), and completion of the 
proof of Theorem 1.6. As in the previous subsection, we write Aj for Axj, 
where j ^ I. 

By the definition of / (and of the norm || • II2), we rewrite 

(36) 5;i|A,/||2 = ^E[(A,r(z(y),z(y) + ^))2]. 

By taking the expectation outside the summation, conditioning on Y 
(and using that Y is independent of the t-variables) and then taking the 
expectation back inside the summation, it is clear that the right-hand side 
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of (36) is smaller than or equal to 

(37) meix^E{{AjT{x,x + v)f). 

We will give an upper bound for the sum in (37) for the case x = 0. Prom 
the computations, it will be clear that this upper bound does not use the 
specific choice of x, and hence holds for all x. 

In the case x = 0, the sum in (37) is, by definition, of course 

(38) y:iiA,-/iii. 






iG/ 

Let X', be an auxiliary random variable that is independent of the X- 
variables and has the same distribution. Let X denote the collection of 
random variables (Xj,i G I), and X' the collection obtained from the col- 
lection X by replacing Xj by X'-. By the definition of Aj/ (and standard 
arguments), we have 

E,-((A,/)2) = lE,-,,[(/(X)-/(X0)'] 
(39) 

= E,,,,[(/(x) - !{x')fi{!{x) < f{x'))], 

where Ej denotes the expectation with respect to Xj, and Ejj/ denotes 
the expectation with respect to Xj and X',. [So, (39) is a function of the 
collection {Xi,i & I,i^ j).] 

Let 7 be the optimal path, as defined in the beginning of Section 4.2.2, 
w.r.t. the i-variables corresponding with the family X . Let li; be a vertex. 
Observe that a change of t{w) does not increase / if u; is not on 7, and 
increases / by at most 6 — a if if is on 7. By this observation, and a similar 
argument as used for (27), we have 

(40) {fix') - f{X))I{f{X) < fix')) <{b-a)Y, I{w needs j), 

U1S7 

and hence 

{f{X)-f{X'))^I{f{X)<f{X')) 

<{h — a)^ 2, H'^ needs j,w needs j). 

Since ||Aj/||| is the expectation w.r.t. the Xi,i^j, of Ej((Aj/)^), we 
have, by (39) and (41), that 



(42) ||A,/||^<(6-a)^E 



> /(n needs j,w needs j) 



■u,uiG7 



To bound the right-hand side of (42), recall the definition [below (29)] 
of r^ij) (with j £ I and w G Z'^), and note that, by condition (i) in Sec- 
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tion 1.4, we have, on an event of probability 1, 
y I{u and w need j) 



u,w(^'y 



(43) 



= ^ ^ /(u and w need j,max(r„(j),r^(j)) = A;) 

OD 

<2^^^/(n and t/; need j,r„(j) =k,r^{j) <k) 

k=l u£fw£^ 



<2^^/(u needs j,r„(j) = k)\{w e-f-rw{j) <k}\- 
k=i u^'r 

By condition (ii), each of the vertices w in the last line of (43) is located 
in a hypercube of length Ci^k centered at u. By this and Observation 4.1, 
it follows that the number of w^s in the last line of (43) is at most Cijk. So 
we have, with Cis = 2Ci7, 

oo 

y I{u and w need j) < Cis /^ k >^ I{u needs j, ru{j) = k), 

u,w£^ k=l uG7 

which, together with (42) [and using the definition of ikiu)] gives, after 
summing over j, 

oo r 

J^||Aj/||i<Ci9^A;E J^/(tx needs ifc(7x)) 



jei 



k=l 



'-uG-y 



(44) 



Cigy^kK y ^I{u needs ik{u)) 

fc=l L^g^ 

+ Ci9 y^ A;E y^I{u needs ik{u)) 



k>\v\ 



'-UG7 



The sum over /c > 1^1 in the right-hand side of (44) can be bounded very 
easily as follows: by Observation 4.1(b), all vertices of 7 are inside the box 
[— C2|u|,C2|w|]'^. Hence, the above-mentioned sum over k > \v\ is at most 

C19 y k y ¥{u needs ikiu)). 

k>\v\ uel-C2\v\,C2\v\]'' 

By condition (i), and since the number of vertices u in this last expression is, 
of course, of order |'y|'^, this expression is smaller than or equal to a constant 
times 

k>\v\ 

which is smaller than a constant C2o- 
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To bound the sum over k < \v\ in the right-hand side of (44), observe that, 
by condition (ii), if a set V C 1i is such that |n — ti'| > C2ik for ah u,u' £V 
with u^u' , then the cohection of random variables 

{I{u needs ik{u)),uG V) 

is independent. With this in mind, we partition, for each k, TL in boxes 

B^(w) := [-[CsiA:], fCsiA:])'^ + 2[C2i^>, w G %^ . 

We will say that two boxes B\^{yS) and By^(u) are neighbors [where u = 
{ui,...,Ud) and w = {wi,...,Wd)] if maxi<j<rf [-Wj - Ui\ = 1. 

By Observation 4.1(a), 7 has at most 6*22^ vertices in each of these boxes. 
Hence, the the sum over /c < [-yl in the right-hand side of (44) is at most 

(45) C23 Y^ k'^E Y^ I{3u G Bk{w) s.t. u needs ik{u)) 

k=l w : (*) 

where (*) indicates that we sum over all w (^7/ with the property that 7 
has a vertex in Bk{w) or in a neighbor of Bk{w). 
Next, partition lA in 2'^ classes, as follows: 

Z^:=z + 2Z'^, zG{0,1}'^. 

So (45) can be written as 

kl r 

(46) C23^A;2 Y ^ ^ /(3nESfc(z + 2«;) s.t. n needs ifc('")) 

where (**) indicates that we sum over all u; € Z with the property that 7 
has a point in Bk{z + 2w) or in a neighbor of Bk{z + 2w). 
Now, for each z € {0, 1}"^, the set 

{w GZ : 7 has a point in -Bfc(z + 2tt;) or a neighbor of Bf^^z + 2w)} 

is a lattice animal and has, for /;;< |u|, by Observation 4.1(b), at most C24:\v\/k 
elements. 

So, from (44)-(46) we get 

\y\ r 

jei k=i zG{o,i}<* '- i«e-C 

(47) s.t. u needs ik{u)) 

+ C20, 
where the maximum is over all lattice animals C with size < C24,\v\/k. 
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Now for each z we have, by the observation below (44), that 

(/(3u e B}^{z + 2vS) s.t. u needs ik{u)),w € Z ) 

is a collection of independent 0-1-valued random variables. For each w, this 
random variable is 1 with probability less than or equal to 

(48) \Bk(z + 2w)\mayiF(u needs ik(u)) < , ,^f^ , 

where we used condition (i). 

By (47), (48) and Theorem 3.3, we get 



j£l fc=l ^ ^ 



1/d 



(49) <C2o + C26|t;|^A;2A;-(3'i+-o)/ci 

fc=l 

<C27\v\. 

Together with (36)-(38), this gives the following lemma. 

Lemma 4.4. The numerator of the second term in (23) is at most C27\v\. 

Lemma 4.4, together with (22), (23), Lemma 4.2 and Lemma 4.3, com- 
pletes the proof of Theorem 1.6. 

5. Proof of Theorem 1.7. The proof is very similar to that of Theo- 
rem 1.6 and we only discuss those steps that need adaptation. 

First, we define, for u,w (z Z'^, the following modification of T{u,w): 

(50) T{u,w) := min T{tt), 

n: M— S>ui,|7r|<ci|u— lii 

where |7r| is the number of vertices of vr, and with ci as in condition (iv). 
From this definition, it is obvious that 

\f{0,v)-T{0,v)\ 

< b{\v\ + l)/(^ an optimal path vr from to f with |7r| < Ciltij). 

By this inequality and condition (iv), we get immediately 

Var(f(i;)) - Var(r(t;)) = o(|t;|/log(|t;|)), 

so that it is sufficient to prove (12) for T{0,v). 

Now, with / = r(0, v) and / = T{z, v + z) [with z = z{Y) as in Section 4] 
the proof follows that of Theorem 1.6, with the following modifications: 
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A few lines above (28) we used that 7 has at most CYm vertices in the 
box Bm{w). In the current situation we have to add, as a correction term, 
the probabihty that 7 has more than Cjm vertices in that box. It follows 
easily from condition (iv) that, with a proper choice of C7, this probability 
goes to faster than any power of m. Hence (recalling the definition of m), 
it is clear that (28) remains true. Therefore, the denominator of the second 
term in the proof of Theorem 1.6 is, in the current situation, again larger 
than a constant times loglt"!. 

A few lines before (44) we applied Observation 4.1(a) (which used the 
fact that all t-values were larger than some positive a) to conclude that the 
number of vertices of 7 in a certain box of length of order k is at most 
some constant times k. In the current situation we do not have this strong 
bound, but we can obviously conclude that this number is at most the total 
number of vertices in the box. Because of this, the k in (44) is, in our current 
situation, replaced by k'^. 

A few lines above (45), we again used Observation 4.1(a). Again we have 
to replace a factor k by k'^. By this (and the previous remark) the k"^ in (45), 
and therefore also in (46) becomes fc^°'. 

By the definition of T, the statement about the size of the lattice animal 
[a few lines above (47)] still holds (with appropriate constants). By this and 
the earlier remarks, we now get (47) with the factor /c^ replaced by k'^'^. By 
condition (i'), the denominator in the right-hand side of (48) is now of order 
exp(eoA;^-^), so that the sum over k in this modified form of (49) is still finite. 

This completes the proof of Theorem 1.7. 
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